Tools for graph mining
نویسنده
چکیده
Large real-world graphs often show interesting properties, such as power-law degree distributions and very small diameters. Discovering such patterns and regularities has a wide range of potential applications. It could help us with detecting outliers or abnormal subnetworks (such as terrorist networks or illegal money-laundering rings), maximizing e ciency of disease controlling, marketing, forecasting and simulations, to name a few. A graph generating model, the recursive matrix (R-MAT) model is introduced and a method (AutoMAT) is shown for automatically estimating the input parameters for R-MAT in order for it to match a given realworld graph. Using these parameters, the resulting R-MAT graphs are shown to match many properties of real graphs. Also, a set of plots (A-plots) are introduced as original ways for viewing large graphs. Their applications in nding interesting patterns and outliers in real, large graphs are demonstrated.
منابع مشابه
Automatic Discovery of Technology Networks for Industrial-Scale R&D IT Projects via Data Mining
Industrial-Scale R&D IT Projects depend on many sub-technologies which need to be understood and have their risks analysed before the project can begin for their success. When planning such an industrial-scale project, the list of technologies and the associations of these technologies with each other is often complex and form a network. Discovery of this network of technologies is time consumi...
متن کاملA Systematic Mapping Study on the Usage of Software Tools for Graphs within the EDM Community
The field of educational data mining (EDM) has been slowly expanding to embrace various graph-based approaches to interpretation and analysis of educational data. However, there is a great wealth of software tools for graph creation, visualization, and analysis, both general-purpose and domain-specific, which may discourage EDM practitioners from finding a tool suitable for their graph-related ...
متن کاملStructural graph-based representations used for finding hidden patterns
In graph-based data mining (GBDM) tasks, an accurate data representation is fundamental for finding hidden patterns. However, there does not exist a standard representation to describe structural data because of the specific domain characteristics. Then, different graph topologies could be used as data representation, which is a challenge for GBDM tools. In this paper we explore a methodology f...
متن کاملGraph-based data mining: A new tool for the analysis and comparison of scientific domains represented as scientograms
The creation of some kind of representations depicting the current state of Science (or scientograms) is anestablishedandbeaten track formanyyearsnow.However, ifweare concernedwith the automatic comparison, analysis andunderstanding of a set of scientograms, showing for instance the evolution of a scientific domain or a face-to-face comparison of several countries, the task is titanically compl...
متن کاملAnalysis of the Time Evolution of Scientograms Using the Subdue Graph Mining Algorithm
Scientograms are a kind of graph representations depicting the state of Science in a specific domain. The automatic comparison and analysis of a set of scientograms, to show for instance the evolution of a scientific domain of a given country, is an interesting but challenging task as the handled data is huge and complex. In this paper, we aim to show that graph mining tools are useful to deal ...
متن کاملOn the Integration of Graph Exploration and Data Analysis: The Creative Exploration Toolkit
To enable discovery in large, heterogenious information networks a tool is needed that allows exploration in changing graph structures and integrates advanced graph mining methods in an interactive visualization framework. We present the Creative Exploration Toolkit (CET), which consists of a state-of-the-art user interface for graph visualization designed towards explorative tasks and support ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2004